|
|
Accession Number |
TCMCG020C05238 |
gbkey |
CDS |
Protein Id |
RAL50882.1 |
Location |
complement(join(509877..509915,510326..510610,511493..511610,512043..512104,512191..512316,512927..513127,513947..514177,514315..514469,514597..514690,514986..515208,516236..516309,517564..517655,518289..519059,519463..519918,520717..520816,521202..521375,521454..521642)) |
Organism |
Cuscuta australis |
locus_tag |
DM860_005238 |
|
|
Length |
1129aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA394036, BioSample:SAMN07347267 |
db_source |
NQVE01000054.1
|
Definition |
hypothetical protein DM860_005238 [Cuscuta australis] |
Locus_tag |
DM860_005238
|
CDS: ATGGAGTTTGAGGTGCGAGTGGTAGGGGGAATTGAGAGCTGCTTCGTTTCACTTCCTCTTCCACTGATTCAAACCCTTCAGTCCCGTTACCTCCCTCCGATTCTCGCAATCGAACTCCGCTCCTCCGAGGGACGCCTCTGGCACGTGTCCTGGTGCGGCTCCGTTTCCTCTTCCCCTTCATCCATTGAGATTGCTCGCCAATATGGGGGGTGTATCGGATTGAGTGATGGCATGGGGGTCCGGGTGCGGGTGGTGAGCAACTTGCCCAAGGCTACACTGGTGACCATAGAGCCACTTACGGAGGACGATTGGGAGATTCTGGAACTTAATTCGGAGCTTGCTGAAAATGCTATTCTGAAGCAGGTTGGAATCGTTCACGAAGATATGAGATTTCCATTGTGGTTGCATGGACAAACCGTTGTCATGTTTCTTGTCACGTCAACCTTTCCACAAGAACCAGTAGTCCAACTTGTACCCGGAACAGAAGTTGCAGTTGCTCCGAAAACTCGTAGAACTTCATCAATCCAGCCTCCTGAGCCGGAACATGCAATGACTAAAGTATTACTTCGCGTTCAGGATTCAGACAGTAGATTCATCTGTAAGCACGATGTTAATGGTGTCAATGTGGATATCTTGTTTACTTCTGGAATTTTCATCCATCCGGAAACAGCCAAAAATTATTCATTGAGTTCTCTTCAATTTGTGGTGATATCTCCTCGAGTACTCTCCAAAAAGAGCAACAAGAATCCACATTCTAAAACGAGTGCAACCGAAAAAGAATCCAACAATGGAAATCTTTCCAATGAAAAGGGTTTCCATGAAGTTCTTTTTCGTATATTGTTGTCCGAGTCTGTGGCAAAAGGACATGTAATGCTATCTCAGTCGCTCCGCCTCTATTTGGGTGCAGGACTTCACTCATGGGTATATTTGAAAAGGTGCAAAATTATAATTAAAAAAGACATCCCTCTTGTTTCAATTGCTCCTTCCCATTTCAAAATTTTTAAAAAAGATGACGTTGACAAGAGCAGTCTAGATATTGTAAACAACCACGAGAACCATTTACAAAAAGATGAGCTACGGAGAAGCAGTTCCAATGCTGAAATGGGTATCAGGGACTGGTCAATGCACGAGAAGATTGTTAAAGTTTTTTCTTCTTTAACCTCCTTTGCTGGAGCGGAAGAAACAACTACTACAATTGGAAAAACAAATACTGAAAATGGCTTTGTAAGTGGTATATCTACTCTTTTACGTGCATGGTGCTTTGCTCAGCTCAATACTGTTGCTTCAAATGCAGCAGATGTCAGTTCACTGGTTATTGGAAGCAAATCGTTACTTCATCTAAAAGTAAAAAACCACAATTTACCAAGACATGGAATGGTGCACACACTTGGTGACAAGTTTCCCAGATGCAGAAATTCAGCTGACGAAACGTCAGTTGATCTATTCTATGTCTTGTCTCTTTCGGAGGATTCTGTGCATGGTGAAGACATCAATGCATATGAACTTGCATTTGAGAAAGGTGGTCGGGACAACTATAGCTCAAGAAGCTTAGATATGTTGCTGGGAAAGCTTAAATTGGATGATAATCTATCCTTTCATGCTGCCAATGAGATTTCTCCTCAAAACATACAGAGTGCTTCAATTACTTCATTAGATTGGATGGGTGCAGCTCCTGTTGATGTAATTTATAGGTTGAGAACTTTATTATCACATGCTTCTGGGATGTTACTTAGTAGTTATAATCTTCCATTGCCTGGGCATATCCTAATCTATGGACCTCCAGGTTCTGGGAAAACATTATTGGCCAAATTCTCGGCAAAATCTATGGAAGGATGTCTAGATATTCTGGCACATATAGTTTTCATATCTTGTTCTAAACTTGTTTTGGAGAAACCTTCAACAATCCGTCTATCACTTTCCAACTACATTTCTGAAGCTTTAGTTCATGCGCCATCTGTCGTCATCTTGGATGATTTTGATAGCATCATTGCACCTTCCTCGGACATGGAGAGATCTCAACATTCATCCTCTTCTGCAGCGCTAATTGAGTTTCTTGCTGATATATTAGATGAATATGAGGAAAATTGCAGGAAGATCTGTGGAATTGGTCCCATAGCATTTATTGCTACAGCACAATCGCTGGCTAACTTTCCACAGATCTTGAGCTCTTCAGGGAGATTTGATTTTCATGTCAAGCTGCATGTGCCTGCTGCTGCTGAACGTTCTGCTATACTAAAACATGAGATCAAGAAGAGATCCTTACAGTGCTCTGATGACCTTTGTTTGGATATAGCATCCAAATGTGATGGATATGATGCTTATGATCTGGAAATATTGGTTGACAGATCAGTGCATGCTGCCATTGGTCGTTTACTGTCTGATGAATTAGCCTCTGGAGAAGATGCAAAACATACTTTGGTTAGGGGTGATTTTGTGCAAGCAATGCAAAATTTCCTTCCAGTGGCTATGCGCGACATCACTAAACCAGCCACTGAAGGCGGTCGCTCTGGTTGGGAGGATGTTGGAGGTCTTAATGACATTCAAAATGCTATTAAAGAGATGATTGAGCTACCTTCGAAGTTTCCAAATCTCTTTGCACATGCTCCACTAAGAATGCGATCCAATATTCTCTTATATGGCCCCCCTGGTTGTGGCAAAACACATATTGTTGGTGCCACTGCTGCAGCCTGTTCACTACGGTTCATCTCTGTGAAAGGGCCTGAGTTGCTTAACAAATATATTGGTGCTTCTGAACAAGCTGTCAGAGATATTTTCAGTAAGGCGGCAGCAGCATCCCCATGCATTCTCTTCTTTGATGAATTTGATTCGATCGCTCCAAAGAGAGGGCATGATAACACTGGAGTGACAGATAGAGTGGTCAATCAATTTCTGACAGAGTTAGATGGTGTTGAAGTTTTGACCGGTGTGTTTGTTTTTGCGGCTACTAGTCGACCGGATTTGCTGGATGCTGCACTTCTACGACCCGGTAGACTAGACCGACTCTTATTTTGTGATTTCCCATCACCAGGGGAAAGGTTGGATATTCTTAGAGTCCTTTCTAGAAAGCTGCCAATGGCCAGCGATGTAGACTTGGAAGCCATAGCTTATATGACTGAAGGCTTCAGTGGAGCTGACCTTCAAGCGCTTCTTTCAGATACACAGCTTGAAGCAGTTCACGCGCTTCTGGAAAGCGAAGATGGCGGCGTAATTGGAATGGCACCTGTTATCACAGATGTTCTTTTGAAATCAGTTGCGGCTAAGGCTAAACCATCAGTACCAGAGGCTGAGAAGCAGAGGCTGTATGATATTTATAACCAGTTTCTTGATTCAAAAAGATCTGCTGCTGCACAGTCGAGAGATGGAAAAGGCAAAAGAGCAACCCTAGCGTAG |
Protein: MEFEVRVVGGIESCFVSLPLPLIQTLQSRYLPPILAIELRSSEGRLWHVSWCGSVSSSPSSIEIARQYGGCIGLSDGMGVRVRVVSNLPKATLVTIEPLTEDDWEILELNSELAENAILKQVGIVHEDMRFPLWLHGQTVVMFLVTSTFPQEPVVQLVPGTEVAVAPKTRRTSSIQPPEPEHAMTKVLLRVQDSDSRFICKHDVNGVNVDILFTSGIFIHPETAKNYSLSSLQFVVISPRVLSKKSNKNPHSKTSATEKESNNGNLSNEKGFHEVLFRILLSESVAKGHVMLSQSLRLYLGAGLHSWVYLKRCKIIIKKDIPLVSIAPSHFKIFKKDDVDKSSLDIVNNHENHLQKDELRRSSSNAEMGIRDWSMHEKIVKVFSSLTSFAGAEETTTTIGKTNTENGFVSGISTLLRAWCFAQLNTVASNAADVSSLVIGSKSLLHLKVKNHNLPRHGMVHTLGDKFPRCRNSADETSVDLFYVLSLSEDSVHGEDINAYELAFEKGGRDNYSSRSLDMLLGKLKLDDNLSFHAANEISPQNIQSASITSLDWMGAAPVDVIYRLRTLLSHASGMLLSSYNLPLPGHILIYGPPGSGKTLLAKFSAKSMEGCLDILAHIVFISCSKLVLEKPSTIRLSLSNYISEALVHAPSVVILDDFDSIIAPSSDMERSQHSSSSAALIEFLADILDEYEENCRKICGIGPIAFIATAQSLANFPQILSSSGRFDFHVKLHVPAAAERSAILKHEIKKRSLQCSDDLCLDIASKCDGYDAYDLEILVDRSVHAAIGRLLSDELASGEDAKHTLVRGDFVQAMQNFLPVAMRDITKPATEGGRSGWEDVGGLNDIQNAIKEMIELPSKFPNLFAHAPLRMRSNILLYGPPGCGKTHIVGATAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAAAASPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPGERLDILRVLSRKLPMASDVDLEAIAYMTEGFSGADLQALLSDTQLEAVHALLESEDGGVIGMAPVITDVLLKSVAAKAKPSVPEAEKQRLYDIYNQFLDSKRSAAAQSRDGKGKRATLA |